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Abstract 

In this work, we study a noiseless broadcast link serving K users whose requests arise from a library of N 
files. Every user is equipped with a cache of size M files each. It has been shown that by splitting all the files into 
packets and placing individual packets in a random independent manner across all the caches, it requires at most 
N/M file transmissions for any set of demands from the library. The achievable delivery scheme involves linearly 
combining packets of different files following a greedy clique cover solution to the underlying index coding problem. 

This remarkable multiplicative gain of random placement and coded delivery has been established in the asymptotic 
regime when the number of packets per file F scales to infinity. 

In this work, we initiate the finite-length analysis of random caching schemes when the number of packets F 
is a function of the system parameters M, N, K. Specifically, we show that existing random placement and clique 

A shorter version of this manuscript appeared in the 52nd Annual Allerton Conference on Communication, Control, and Computing (Allerton), 
2014 as an invited paper (TJ. 
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cover delivery schemes that achieve optimality in the asymptotic regime can have at most a multiplicative gain of 
2 if the number of packets is sub-exponential. Further, for any clique cover based coded delivery and a large class 
of random caching schemes, that includes the existing ones, we show that the number of packets required to get a 
multiplicative gain of is at least 0{{N/MY). We exhibit a random placement and an efficient clique cover based 
coded delivery scheme that approximately achieves this lower bound. We also provide tight concentration results that 
show that the average (over the random caching involved) number of transmissions concentrates very well requiring 
only polynomial number of packets in the rest of the parameters. 

Index Terms 

Coded multicasting; Caching; Index Coding; Clique-cover; Finite-length analysis. 

I. Introduction 

Wireless data traffic is increasing at an alarming pace dominated by video on demand services El, and the LTE 
bandwidth available has not increased to cope with the increasing demand. Recently, to tackle this problem, caching 
at the wireless edge has been proposed 0, 11. Caching could take place at small cellAViFi access points or at 
end user devices 0, 0, by prefetching popular content at off-peak periods to alleviate peak traffic later. It has 
been shown that in the presence of some form of communication between caches (e.g., device-to-device or D2D 
communications), caching gains proportional to the aggregate cache size can be obtained 0 , 0 . However, when 
there is no direct communication between caches, conventional caching schemes are limited to local cache hit gains. 

Consider that a set of demands for distinct files arriving at the base station (in a wireless macro cell setting) 
where each demand corresponds to some user mobile device in the cell. For simplicity, consider the case when 
user mobile stations are equipped with cache memory. Files stored in (say) user A’s cache may or may not be 
relevant to that user’s demand. However, it is possible that another user’s demand is stored in the cache. This could 
benefit the number of files (or its equivalent in terms of bits) that the base station needs to broadcast to satisfy all 
demands if the cache content of every user is taken into account. The abstract problem called index coding tries to 
model the aspect of wireless caching systems that do not have local cache hits but their cache (or what is called 
side information) overlaps with other users’ demands. In an index coding problem, we have K caching mobile 
devices served by a noiseless broadcast channel. Fach caching device requests a distinct file that is not there in 
its cache. The broadcast transmissions can be a linear combination of files. Fach user recovers its demand using 
the broadcast transmissions using its cache content. The objective is to find the minimum broadcast transmissions 
(termed as broadcast rate) given a set of demands and given cache content for each user. This problem is known to 
be extremely hard to compute and approximate even when linear combinations are done over the binary alphabet 
0. The problem can be represented as a side information graph where vertices are users and a directed edge means 
that one user’s caching device has the some other users’ desired packet as cached information. This problem has 
received a lot of attention in the information theory literature ifTOl - lfT^ because it 1) encapsulates the difficulty of 
all network coding problems and 2) any linear coding scheme for index coding is equivalent to a linear interference 
alignment scheme. We provide an example : User 1 requests packet 1 and User 2 requests packet 2 and each user 
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has the other users’ packet. Although there is no local cache hit, the side information present at both users can be 
used to reduce the number of transmission by 1 by transmitting the XOR of both packets. 

In another line of work, motivated by this ’index coding property’ that allows using users side information to create 
coded multicast transmissions for users requesting different files, the problem of designing the side information is 
also considered. This problem is referred to as either the coded caching or the caching-aided coded multicasting 
problem. Hereafter, we refer to this simply as the caching problem. The setting is same as the index coding problem 
where there is a library of N files from which user requests arise and every device has a memory of size M. The 
difference is that there is a placement phase, which is free of cost, that involves populating all user caches with files 
from the library. The problem has been studied where order optimal peak broadcast rate for worst-case demand, order 
optimal average rate for uniform demand distribution and order optimal average rate for Zipf demand distribution 
for the demand have been characterized. However, all the achievable schemes work in the asymptotic regime when 
the number of packets per file scales to infinity. In this paper, we consider the case of peak rate over worst-case 
demand pattern. We show that existing algorithms for placement and delivery give very little gain even when the 
file sizes are exponentially large in the number of users, and derive lower bounds for a general class of random 
uncoordinated placement schemes and clique cover based delivery schemes. We also modify existing algorithms to 
approximately match these bounds. A detailed review of the caching problem is given below. 

A. Related Work 

In the caching problem, there is a common broadcasting agent serving K users through a noiseless broadcast 
channel. Every user requests a file from a set of N files. Each file consists of F bits or packets. Every user has 
a cache of size M files. Eiles or parts of it (’packets’) are placed in every cache prior to transmissions assuming 
that the library of file requests is known in advance. The objective is to design a placement scheme and delivery 
scheme that optimizes (or approximately optimizes) the maximum number of file transmissions required over all 
possible demand patterns. This problem has been well studied in the asymptotic regime when F ^ oo. 

A deterministic caching and delivery scheme which requires packets per file to achieve a gain of KM/N 

was proposed in na. Eollowing this, a random placement scheme that allows populating user caches independently 
of each other was proposed in m. In this uncoordinated placement phase, every user caches MF/N packets of 
each file n € [1 : N] chosen uniformly at random and independently of other caches. The delivery scheme is a 
greedy clique cover on the side information graph induced by the underlying index coding problem (refer Section 
nil, where a set of packets of possibly different files are XORed if for all packets, at least one user desiring the 
file corresponding to the packet can recover the desired packet only by using its cache contents. Eor example if 
A + B + C was sent, a user wanting A could recover A if the user has B and C stored in its user cache. The peak 
broadcast rate (number of file transmissions) of this scheme was shown to be (in the limit F oo): 

^ 

Here, Rp{M) denotes the peak broadcast rate. The peak rate means the worst case rate over all demand patterns of 
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the K users from the library. More precisely, this is the average peak rate because it is averaged over the randomness 
in caching. Note that, if coded multicasting is not used then the rate is given by itT (1 — M/N) from the gain due 
to just local cache hits. It was shown through cut-set bounds that the result in ([T]i was optimal up to a constant 
factor. The placement and delivery algorithms that achieve this average peak rate are given in Algorithms [T| and |2] 
respectively. 


Input: Parameters K, M, N and F. 
for every user fc G [1 : iT] do 
for every file n € [1 : A^] do 

I Choose a random MF/N subset of F packets of file n and place it in cache k. 

end 

end 

Output: Cache configuration for every user k € [1 : K], 

Algorithm 1: UldPlacement (Placement Algorithm in lll4ll ) 


Input: Parameters K, M, N and F, caches for all users k € [1 : K] and demand set d = [di,d 2 ... 

for every subset <S C [1 : AT] do 

Let Vk,s-k be the vector of packets from file requested by user k but stored exactly in the set of caches 
S-k. 

Transmit ®k(^sVk,s-k- 

end 

Algorithm 2: OldDeliverv (Delivery Algorithm in lll4i l. XORing (ffl’ingl vectors of different lengths means 
that all shorter vectors are zero padded to match the longest and then XORed. 

This was followed by the works of ifTSll and ifT^ where they analyze the case of average number of transmissions 
when the user demand follows a popularity distribution over the library. Specifically, authors in ifTbll consider the 
case in which file requests follow a Zipf popularity distribution. They provide caching and delivery schemes that 
achieve order optimal average number of transmissions in the asymptotic regime. The caching distribution, unlike 
in the worst-case, has to be designed with respect to the collective demand distribution. Interestingly, they also 
showed that for Zipf parameter between 0 (uniform popularity) and 1, even the peak rate scheme given above is 
sufficient for order optimality in the asymptotic regime F ^ oo. 

B. Our Contribution 

We consider the caching problem with K users, N files in the library and a cache size of M files. We are 
interested in the peak broadcast rate (number of file transmissions) for the worst-case demand. Our contributions 


are: 
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1) We first show that the existing random uncoordinated placement scheme (Algorithm [T]i for this problem and its 

delivery scheme (Algorithm |2]i has a rate above vvhen F < exp {KM/N). When compared 

to the asymptotic result, for a large asymptotic gain when KN/M is n((logAr)^), the file size requirement 
is super-polynomial. 

2) We propose a slightly modified placement scheme (Algorithm O. We show that the old delivery algorithm 
(Algorithm |2]i coupled with the new placement scheme has similar file size requirements suggesting a needed 
change in the delivery scheme. 

3) We show that, under any random placement scheme which is independent and symmetric across users (every 

file packet placement in a user cache is independent of its placement in other caches, every file packet has 
equal marginal probability of being placed in a cache), any clique cover based scheme (using clique cover 
on the side-information graph) requires a file size of approximately for achieving a peak 

average rate of ^ (1 — M/N). Here, the average is over the random caching involved. 

4) Since the studied placement schemes are random, it is important to consider the spread in performance due 
to this randomness. We show that the file size requirements for any clique cover scheme over both random 
placement schemes (old and the new) is polynomial for the average number of transmissions to concentrate 
for any demand pattern. It is sufficient to have a file size of {0{K^ log K)) for the random rate (over the 
randomness in caching) to be within a constant multiplicative factor from the mean. 

5) We finally exhibit a modified delivery scheme that improves on Algorithm |2] through an extra pre-processing 

step. This modified delivery scheme applied with a specific user grouping along with the new placement 
scheme provably achieves a rate of roughly with a file size of 0(([W/M])®+^(log(A^/M))®+^(2e)®) 

approximately matching the lower bound. The new placement scheme plays an important role in simplifying 
the analysis of this algorithm. 

In Section ini we provide the definitions of two random placement schemes (‘old’ placement scheme used in the 
literature and a ‘new’ placement scheme) and a delivery scheme previously used in literature. In Section [Till for 
any clique cover scheme, we show that the file size requirements are only polynomial in K for the normalized 
transmissions in both random placement schemes to concentrate well. In Section IIVI we show that the previous 
delivery scheme, that works asymptotically very well, gives only a constant gain (of 2) even for exponentially large 
file sizes. We also show that any clique cover scheme with a random placement scheme that is ‘symmetric’ requires 
exponential file size in the ‘target gain’. For constant target gains, the file size requirement is polynomial in the 
ratio of library size to the cache memory size per user. In Section [V] to bridge the gap, we design an efficient clique 
cover scheme, which together with the new placement scheme, achieves the file size lower bound approximately 
orderwise. 


II. Definitions and Algorithms 

We consider the problem of designing placement and delivery schemes when K users request files from a library 
of N files (N > K) and each user has a cache of size M files. In the placement phase, a file is divided into F 
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packets/bits. Then each packet is placed in different user caches (randomly or deterministically). We are interested 
in an efficient placement scheme and an efficient delivery scheme consisting of coded XOR transmissions of various 
packets that optimizes the peak rate over worst-case demands. An efficient delivery scheme computes the coded 
transmissions needed in time polynomial in parameters N, K, F, M, while a placement scheme being efficient 
ensures F is as small as possible. Let us denote a set of demands by d = [di, ^2 ... dx] , dfc £ [1 : iV]. A packet 
/ belonging to file n £ [1 : A^] is denoted by (n, /). 

Definition 1. After a placement scheme, cache configuration C is given by the family of sets Snj for all files n 
and I F f < F where Snj C [1 : AT] is the set of user caches in which packet (bit) f of file n is stored. 

Every demand d and a cache configuration induces a directed side information graph G = (V, E) where there 
are KF nodes where (dk, /) is the label for each node representing the /-th packet of file dk- There is a directed 
edge from (dk, /) to (dj, /') if the file packet f of file dj is stored in the user cache k. 

Definition 2. A clique cover delivery scheme corresponds to covering nodes of G by cliques. A clique is a set of 
vertices where there are edges in either direction between all vertices. 

It is easy to see that, XORing all the packets in the clique formed by (dki, fi), {dk^, / 2 ) • ■ • {dk,,,, fm) implies 
that user kj,l < j < m will be able to decode the packet {dk^, ff) by using all other packets in the XOR from its 
cache. Note that, here we do not require the demands to be distinct. 

Let d) be the number of normalized transmissions (total number of bits broadcast divided by file size F) 

achieved by a given generic clique cover scheme A on the side information graph induced by the placement C and 
demand d. In the literature, sometimes i?(C,d) is also called broadcast rate or simply rate. We replace A by a 
short italicized string to denote various algorithms. 

A. New Placement and Delivery Schemes 

We first provide our new placement and delivery schemes in Algorithms |3] and 0] that forms the basis of all our 
results. The new placement scheme simplifies analysis and helps us to get concentration results. The new delivery 
scheme is just an efficient polynomial time (in all the parameters) implementation of the old delivery scheme. 

Remark: Xd^j in Algorithm 0] refers to the content of packet / of the file dk. Also, it is easy to see that 
Algorithm 0] runs in time polynomial in K and F. 

Let (C,d) denote the normalized transmissions achieved by Algorithm 0] Here, the string nd denotes the 
delivery scheme in Algorithm 0] Here, A in d) is replaced by a string nd to denote Algorithm 0] Let d^ 

denote a set of distinct demand requests by users, i.e. every user requests a distinct file. Let R°E (C, d) denote the 
number of normalized transmissions under the optimal clique cover scheme on the side information graph due to 
the cache configuration C and the demand pattern d. 

When C is chosen randomly, i?”‘^(C,d) is a random variable. Let Ec denote expectation taken over the cache 
configuration according to a specified random placement described by the string c. Further, let E^ denote expectation 
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Input: Parameters if, M, N and F. 

Let F = \N/M~\F' packets and F' is an integer. Let every file be divided into F' groups each of size 
\N/M~\ each. 

for every user k £ [1 \ K\ do 
for every file n € [1 : A^] do 
for /' € [1 : F'] do 

/'-th packet of file n in user fc’s cache is randomly uniformly chosen from the set of \^~\ packets 
of group /' of file n. 

end 

end 

end 

Output: Cache configuration for every user k £ [1 : K], 

Algorithm 3: NewPlacement 


Input: Parameters K, M, N and F, caches for all users k £ [1 : K] and demand set d = [di,d 2 ... 

Let (7 = 0. Let Sd^j C [1 : K], V/c S [1 : A'], f £ [I : F] he the exact subset of users in which the /-th 
packet of file requested by user k is stored. 

Let D C [1 : AT] X [1 : A] be the file packets that are stored in the user requesting the corresponding file, i.e. 

D = {{dkj) : k £ Sd,j}. 

for (4,/)e[l:iL]x[l:A]-Ado 

if (4,/) ^ C then 
Let A = %. 

for j G [1 : A'] — m do 

if 3(j, /') ^ C for some /' : Sa^j' = Sdk,f[jk - j then 

I 

end 

end 

Transmit Xd^j ©(j,/')eA Xd^j'- 

C^C[j{dkJ)[jA. 

else 

I Proceed with the next iteration. 

end 


Algorithm 4: NewDelivery 










over a demand distribution described by d. Let Ec,d denote the expectation with respect to both. Let Cop denote the 
‘old’ random placement according to Algorithm[T] Let c„p denote ‘new’ random placement according to Algorithm 

E] 

We first note that our new delivery scheme performs identically to Algorithm |2] It is an efficient implementation 
of the old one. 

Theorem 1. The number of transmissions of Algorithm^ is identical to the number of transmissions of Algorithm 
|2]/or a given placement and a set of demands. 

Proof: It is easy to see from the description that Algorithm |4] is an efficient way to implement Algorithm |2] 

■ 

Even the new placement scheme is very similar to the old placement scheme except that it reduces lots of 
unwanted correlations between different packets belonging to the same file. This helps us simplify analysis. 

III. Concentration results 

A. New Placement Scheme 

Now, we state Theorem |2] about concentration of i?(C, d) around its mean for the two placement algorithms. 
Please note that, the concentration results hold for any delivery algorithm that provides a clique cover on the side 
information graph induced by C. Therefore, we do not specify the algorithm used and hence we drop A in R^{C, d). 
Before that, we provide a standard technical lemma regarding concentration of martingales. 

Lemma 1. (Azuma-Hoeffding, McDiarmid) Consider a random variable Z = f(Xi,X 2 ...Xn) where /(•) is 
a real-valued function and Xi,X 2 ...X„ are n random variables. Then {V\Z\Xq, Xi, X 2 ... XiW^^Q forms a 
martingale. Here, Xq is taken to be a constant. Suppose, these random variables satisfy either one of the following: 
1 ) 

|E[Z|Xi,X 2 ...Xi]-E[Z|Xi,X 2 ...X,_i]| <c, (2) 

2) (Average Lipschitz Condition) 

\E[Z\Xi,X 2 ...X, = a]-E[Z\Xi,X 2 ...X^ = a']\<c^ (3) 

3) (McDiarmid’s Inequality) Suppose the set of random variables {Xi} are independent. 

\f{Xi,X2,...X,,...Xn)-f{Xi,X2,...Xi...Xr, )| <c, (4) 

Then, the following concentration result holds: 



Pr(|Z-E[Z]| >t)< 2exp 


(5) 
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Theorem 2. For the random placement (denoted by string Cnp) given in Algorithm any demand distribution 
denoted by string d (including a singleton distribution on a specific demand) and for any clique cover delivery 
scheme, we have: 


Prc„,,d (|i?(C, d) - [R{C, d)] I > eE,„^,4i?(C, d)]) < 2 exp - 


2e2 (E,„^,4i?(C,d)])V' 


K 


Ar2 

JP 


(6) 


Proof: We use martingale analysis on a generic clique cover algorithm. We denote any generic clique cover 
algorithm by algorithm A. Clearly, the number of transmissions: 


R (C, d) = h {Sdi,i,Sdi,2 ■ ■ ■ Sdi,F ■ ■ ■ Sdk,3 ■ ■ ■ SdK,F) (7) 

for some function hf) where Sd^j ^[1 K] is the subset of users caches in which the file packet / of file dk is 
cached. In other words, the number of transmissions given C and d is determined fully by specifying Sd^j, Vdfc € 
[1 : K], / S [1 : i^]. Further, Sd^j is dependent on both the cache configuration C and the demand d. 

Now, we apply Lemma [T] with random variables Xd^j set to Sd^j, \/k G [1 : K], / £ [1 : f] and Z is set to 
i? (C, d). Consider the expression for a specific (dk, /): 


Cdu = [R (C, d)| 5i,i, ... Sd,j] - [R (C, d)| 5i,2 •. • 5,, j_i]| 


( 8 ) 


In the first term in dill, let us assume that the choice of Sd^j is consistent with the previous choices of Sig 
... SdkJ-i- In Algorithmic every file is grouped into F' groups each of size [^]. Let us assume that packet / 
of the file dk belongs to group g. The placement of file packets is independent across the groups g. The choice of 
S'dfc,/ affects the placement of at most packets belonging to group g of file dk- Other file packet placements 
are unaffected. Let V = {(dfc,/) : / belongs to group g} be the set of bits in the same group g of file dk- 
Consider a new Algorithm B: 1) Run clique cover algorithm A excluding the file packets in V- 2) Then, transmit 
the file packets in V separately. The file packets in V is not used in Step 1 of algorithm B. Let Rb{C, d) be the 
number of transmissions in Step 1 of Algorithm B. Clearly, the following holds: 

r—1 

RB{C,d)<R{C,d)<RBiC,d) + ^ (9) 

r 


This is because, the first step of Algorithm B employs the same clique cover scheme as Algorithm A and operates 
on a sub-graph induced by the file packets in the system other than V- Therefore, the number of transmissions has 
to be reduced. Further, adding the packets of V in step 2 is a sub-optimal way of improving algorithm A- 

Therefore, both E^^^, [i? (C, d)| 51,2 ■ ■ ■ 5’^^, j = ^4] and [i? (C, d)| 5i,i, 5i,2 ■ ■ ■ are at most 

r TV 1 

away from the performance of Step 1 of algorithm B averaged over their respective cache realizations. 
Further, the performance of Step 1 of algorithm B (i?B(C,d)) is independent of the choice of Sd,, f because the 
possibly affected file packets (in set V) have been removed in Step 1 of algorithm B- Therefore, Cd^j < Vfc 
in Lemma [T] 

Hence, applying Lemma [T] we have: 


(|5(C,d) -Ec„p,d[5(C,d)]| > eEc„^,d[7?(C, d)]) < 2exp - 


2e^ (E,„^,,[fl(C,d)])^ 
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\ ) 


Remark: The above result shows that when F > \ -?- ^ log K, then with probability at least 

1 — 1/A"®, A(C, d) € [(1 — e)E (A(C, d)) , (1 + e)E {R{C, d))]. But, for these algorithms to have a non-trivial gain 
even when A —> cx), KM/N > 1 (see ([T]i). This means that N/M < K. Hence, F = 0{K^ log AT) is sufficient 
for R{C, d) to be below (1 + e)Ec„p,(i (A(C, d)) with very high probability. 


B. Old Placement 

Theorem 3. Under the old placement scheme Cop and any demand distribution on d (including a singleton 
distribution on a specific demand), the number of transmissions for any clique cover scheme satisfies the following 
concentration result: 

Pr,„^,4|A(C,d)-E,„^,4A(C,d)]|>eE,„^,4A(C,d)]) (11) 

Proof: We use a martingale argument as before. Consider a generic clique cover scheme implemented by 
Algorithm A. As before, R{C,d) is a function of {<S'dfc,/}fcG[i:if],/e[i:F] where Sd^j C [1 : A"] is the subset of 
caches in which /-th file packet of file dk is stored. In this proof, Sd^j is with respect to the old placement scheme 
Cop. Consider the following: 


cd,j = [R (C, d)| Ai,i, Ai,2 ... = 5] - E,„^ [R (C, d)| Ai,i, 5 i.2 ■.■Sd,j = 5']| 


( 12 ) 


When placement of file packets ... Sd^j-i are fixed, let nj packets be left among MF/N packets allocated 
for the file requested by user k in user cache j. When 5”^^, j = S' C [1 : AT], let the number of packets left for file 
dk at user j’s cache be rij — ls(j). ls(/) = 1 if / G S and 0 otherwise. In any realization, satisfying the first 
conditioning where Sd^j = S in (fT2Ti . for user cache j, nj — ls{j) packets are randomly chosen from the remaining 
F — f packets belonging to file requested by user k. Similarly, rij — I'gij) packets are randomly chosen from the 
remaining F — f packets belonging to file requested by user k for the second conditioning when S^^, j = S' in 

Call. 


Now, consider the following scheme (delivery+placement scheme) with genie aided transmission as follows: 1) 
Genie provides the packet {dk, /) to all users during decoding but is not placed in any user’s cache. 2) We perform 
the old placement. 3) Since genie provides {dk, f) to all users for ’free’, if j = S immediately after step 2, we 
delete the file packet {dk, f) from caches of users in set S and for all users in S replace it with a new random file 
packet from the remaining file packets of the file dk, different from the ones placed according to the old placement 
including the file packet {dk,f) in step 2. We use Cgp to denote ’genie-aided placement’ as described above. 

Let us contrast this with the old placement: In the old placement when Sd^.,} = S, for every j S S, {dk, f) is 
placed in cache j. In the genie aided case, this space at user j has been taken over by a randomly chosen packet 
from file dk that has not been used by the old placement at user j. Everything else remains the identical to old 
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placement. Genie helps every user get the file packet {dk,f) saving one packet transmission. In addition to the old 
placement, jS”! additional random file bits are stored. This could at most save \S\ < K packet transmissions from 
that of the old placement under any clique covering scheme. It is because every user gets at most one extra packet 
in its cache from file dk that it originally had. Therefore, 


0 < [R (C, d) 1^1.1, ... Sd,j-i,Sdu = S]- [R (Cs, d) 


S]< 


K + 1 
F 


(13) 


Note that, in the above equation, conditioning of Cgp till Sd^j is with respect to step 2 (just immediately after the 
old placement) in the genie aided-placement. 

Now, we view the genie aided-placement using a second view: It is exactly identical to performing the old 
placement except that, only for file dk, for each user a random set of MF/N packets are drawn from F — 1 packets 
that excludes the packet {dk, /). The genie aided placement ignores conditioning Sd^j of the old placement. This 
means that: 


[i? (C, d) ... Sd,j-i,Sdu = S]= E,^^.d [i? (C, d) 1^1,1,... Sdu-i,SdU = 0] 

Clearly, from (fT4li . Ec^^.d [i? (C, d) jS'i,!,... Sd^j-i, Sd^j = S] is independent of S. Therefore, 
cdu = |E,„^,d [R (C, d)| Si,u Si,2 . ■. Sd,,f = S]- E,„^.d [R {C, d)| Si,i, Si ,2 ... Sdu = 5']| 

= |(E,„^.d [R (C, d)| Si,i,Si,2 ... Sd„f = S]- E,^^,d [i? (C, d) ... Sdu-i.Sd„f = 0]) - 
(E,„^,d [R (C, d)| Si,i,Si,2 ... S,,f = S'] - E,^^,d [R (C, d) |5i,i,... 5,./ = 0]) | 


^ K + 1 
- F 


(14) 


(15) 


Justification: (a): This is because of (fl^ and (fT4li . Applying the Average Lipschitz condition in Lemma [T] with 
Cdfc,/, we have the result claimed. ■ 

Remark: As before, when F = {-^K^logK^, the rate R{C,d) under old placement is within (1 + e) about 

the expected value multiplicatively. 


IV. File Size Requirements under new and Old Placements 
A. Requirements for Algorithm 0 under New Placement 

Given any cache configuration C and demand d, according to Theorem [T] the number of transmissions of Algo¬ 
rithmic and Algorithm m have identical number of transmissions. Therefore, the expected number of transmissions 
for Algorithm IC under the new placement algorithm is Ec„p (C, d)]. 

Consider any demand distribution for d. Let 1^^ f ’■1*® indicator that the packet /(!</< of group 

g in file dk is placed exactly in the set 5 C [1 : AT] of caches. Now, we have : 

l‘5| . ^ \K-\S\ 

1 - ^TTTTTT I (16) 






\N/M] 
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Also, for S that contain k, we have; 


F' \N/M^ 

|Vfe,s_fe| = ^ ^ iffc./.fl 

9=1 /=1 


(17) 


\N/M^ 

The variables in different groups are independent. ^ ^ is a bernoulli variable since from every group at 

/=! 

most one file bit is stored in a given cache. 


E 


[N/M] 

l5-fc 

/=1 


= \N/M] 


1 


| 5|-1 


f—1 

I Af I , 


1 - 


1 


[iV/M] 


it'-ISI+l 


= M|5|). 


(18) 


Therefore, \Vk,s-k\ is a binomial random variable with F' trials and probability of success /i. 

The expected number of transmissions for all stages S in Algorithm |2] with respect to the new placement is given 
by: 

_ E[max|14,5_fe|J 

(19) 


E[max|14,5_fe|] 

^ I Ml 


In the limit when F' oo, the binomial random variables are concentrated at their mean and therefore, 

m(I<5|) A A 1 ' 


lim E[i?”'^(C,d)] « 


F—¥oo 


5^0 


^ I M I 




1 - 1 - 


f—1 

I M I 


RpiM) 


( 20 ) 


Now, let us consider the case of distinct demands by all users, denoted by d„. We assume that N > K here. 
We are interested in the question; How far are Ec„p d„)] and the limiting peak rate Rp {M) for hnite FI 

Let Bi {n,p) be the binomial distribution for n trials and probability of success p. When the demands of all users 
are different, |14,5_fc| is distributed according to Bi(T"',/i(|5|)) and for different k, the binomial random variables 
are independent. Now, we show that coding gain is roughly at most 2, even when F' is exponential in the targeted 


gain t = 


K 

ThT 


Theorem 4. Let N > K. Then, Ec„p (C,du)] >\[l-^)K when F < (l - exp (2t (l - (l - ^))- 

Proof: 


E 




max|I4,s_fe| 

k^S 


max Yfe 

Ve _ 


F' 


n^Bi(i^',/i(|N|)), Y, = J2Ykj 

/=! 


S^ti 


5/0 


Pr ( U Ykj > 0 

>fee5./e[l:A’'] y 


F'\N/M] 


E Pr {Y,j > 0) - E Pr (Y^j, > 0 f] Y,j, > 0) 

fee5./e[l:F'] ij65,/i,/2e[l:F'],(*./i)/(i./2) 


F'\N/M] 






















13 


> 


> 


E (T) (sF'^,{s)-^s^{Fr{Ks)f) 

S = 1 ^ ^ 


F' \N/M] 


TmI 


if 


S = 1 


rAi 
I M I 


1 - 


E Os^F'{f,{s)y 

I S=1 


f—1 
I M I 


> K 1- 


\N/M] j 2 ' M ' ^ 


K 


K 


> K\l- 


1 


s=l 
X 2K 


2{s-l) 


r—1 

I M I 


2rfi 


1 - 


2(if-s+l) 


r—1 

I M I 


> if 1 - 


f—1 

I M I , 
1 


1- 


f—1 

I M I 


' m' ^ 


K 


s=l 


K 


T¥T 


2(^<-l) 


1 - 




rivi 

I M I 


_ ] ( if (1 - ^ 

2 'm'I I l§] 


+ K{K-1) 


> K \ 1- 


> K \ 1- 


M 

1 





2\ (^-1) 


r—1 

I M I 




1 - 


( 21 ) 


(a) This is because every \Vk^s-k \ is a sum of F' independent Bernoullli random variables and the set of Bernoulli 
variables across different values of k are independent because the demands are distinct, (b) We use the following 

( n \ n 

U ) — E i^i) ~ E n ^j)- i® defined in (fTSl l and Yij^ and Yij^ 

2 = 1 / 2=1 i^j 


are independent if (i, /i) ^ {i, / 2 ). (d) We use: E = 

S>1 ^ 

for p > 0 and further simplification, (e) We use: 1 — a; < exp(—a:), Vx > 0. 


4 Pi(l+P)" and(l+p) 


K-2 


< (i+p) 


K-1 


This implies that when F' < 


1 - 


[N/M] 


exp (2t (l — -jFj (l — f)), the expected number of normalized 
transmissions for Algorithm |2] for distinct requests under the new placement scheme given by Algorithm [3 is at 
least - ^) K. m 

Therefore, there is very little coding gain if we do not have exponential number of file packets (exponential in 
t). 


B. Requirements for Algorithm^ under Old Placement 

Let RJ^'^ (C, d) denote the normalized number of transmissions for the new delivery scheme. Let 1^^ ^ be the 
indicator random variable that bit / of file dk is stored exactly in user caches in the set 5 C [1 : A']. When k € S, 
let us define: 

ii4.5-fei = y if”/ 

/=! 


( 22 ) 
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Here, 


,S-k 

'-dkj 




K-\S\ + 1 


= ^'(|<S|). Consider the case when user demands are distinct 


(implicitly N > K). The following theorem shows that the coding gain is at most 2 even when the file size is 
exponential in the targeted gain t = 

Theorem 5. Let N > K. Then, E,„^ (C, d„)] > i (f - f) iT when F < ^ (l - exp {2t (l - j^) {l - j^)). 

Proof: We have the following chain: 

E 


5^0 


= ^E 

5/0 


max|14,5_fc| 

feG5 


F 


max 'W if H 


/=! 


F 




Pr U 
Vfce5,/G[1:F] 


It/ > 0 


5/0 


^E 

5/0 

K 


F 


E Pt 

fen-FI ^ 


fcG5,/G[l:F] 


it/>0'- 


E Pr 

*je5./i./2G[l:F].(z,/i)/0-,/2) 


(ifa>onit/2>o 


F 


> 




F 


b.f, M 




> K \ 1- 


M 

N 


2\ (^-1) 




> K il —— FKtexp f —2t ( 1 — 


M 


, , 1- 

K / \ K 


(23) 


Justifications are : 

(a)- Since the demands are distinct, when i ^ j, di f dj. Therefore the corresponding indicators are independent. 
Therefore, Pr > 0P| lt /2 > o) = Pr (^tA ^ (^t /2 ^ P^is probability is easily seen 

to be (fj,'(s))^. When i = j and /i f f^, we have: 

1 S—1 

Pr ( lf-|. > 0 n i!E' > o') = u'(s) f (1 - 


N - - 


M 


N - - 


K-s+1 


< (.d'is)y 


( 24 ) 
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(d) is because J < M < N. Step (a) (and its justification) in the above chain of inequalities is the main 
difference between old placement (Cop) and new placement (c„p). 

(b) - This follows the exact same derivation as in the proof of Theorem |4] except for the factor [iV/M]. 

(c) - This follows from: K + < 2Kt. 

This implies that when F < ^ — pvTAff) “ f) ~ f))' expected number of normalized 

transmissions for Algorithmic for distinct requests under the old placement scheme given by Algorithm [T] is at least 
i (l — K. This implies that there is very little coding gain (t = K^) even when we have file size exponential 
in t. ■ 


C. Requirements for any Clique Cover Delivery Scheme 

Let Cup denote a random independent and symmetric placement algorithm that has the following properties: 

1) For any packet (n, /), the probability of placing this in a user cache k is independent of placing it in all other 
caches. 

2) Placing of packets belonging to different files in the same cache is independent. 

3) The probability of placing a packet equals M/N for a given cache. 

Now, we have the following result on any clique cover scheme on the side information graph induced by random 
caching algorithm Cup and a unique set of demands d„. 


Theorem 6. When user demands are distinct, for any clique cover algorithm on the side information graph induced 

by the random cache configuration due to Cup, if^c„ {R{C,du)) < k(i-m/n) g > 2, then we need the 

2 ^ ^ 

number of file packets F > ^ where t = KM/N. Clearly, these bounds apply to both Cop and Cnp- 


Proof: We show this by contradiction. Let us assume that (i?(C,d„)) < m/n) ^ implies: 
Pi'cup > i (by Markov’s Inequality). The number of transmissions i? (C, du) < 

implies that there is at least there is one clique of size g in the side information graph G induced by C and d„. 
Given cache configuration C and distinct demands d„, let Ug denote the number of distinct cliques of size g. So 
we have the following chain of inequalities: 

Prc (R(C,du) < — M/N) \ ^ (there is one clique of size g) 

V 9 J 

< Ecpp (%) 


b 

< 


c 

< 


< 


< 



( 25 ) 
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When F < ^ and g > 2, then probability given by ( l25b is strictly less than 1/4 contradicting the 

assumption. Therefore, the desired implication follows. Justifications are: (a) Pr {X > 1) < ]E[X]. (b) There are 
(^) ways of choosing g users caches. Since all demands are distinct, there are F® ways of choosing g file packets 
belonging to the files requested by the chosen users. [M/NY ^ is the probability that a file packet wanted by one 
of the users is present in g — 1 other user caches. Since the demands are distinct and placement of packets belonging 
to different files are different, the probability of forming a p-clique is given by (c) (^) < ■ 

■ 

Note: We would like to note that Cup represents a broad set of schemes where every file packet is placed in a 
cache independently of its placement elsewhere and no file packet is given undue importance over other packets 
belonging to the same file. 


V. Efficient Achievable Schemes 
A. Deterministic Caching Scheme with User Grouping: 

Now, briefly we would like to explore what can be said about the file size requirements of deterministic placement 
schemes. In this section, we describe a variation of the deterministic caching scheme in that requires a similar 
file size requirement as the previous section for a target gain of g. However, it is not clear if, for a clique cover 
scheme at the delivery stage, this is the best one can do with deterministic caching schemes. In other words, a 
lower bound for deterministic caching scheme similar to the one above is not known. 


Now, we give a description of a deterministic caching and delivery scheme that requires F = 


(K\ 


packets 


to get a gain of p + 1. This follows directly from the deterministic scheme of El. For ease of exposition we 
describe it here: For every file, split the file into packets. For every subset G C [1 : AT] such that |G| = g, 
we place the corresponding packet in the user caches in the subset G. The total number of files per user cache 

(K-i\ 

is ^ < M. This satisfies the memory constraint because the gain g < KM/N. Following the same 

\ g ) _ 

arguments in iflBll . it is easy to show that the peak transmission rate is at most : ■ 

Now, we show a slight modification of the deterministic caching scheme mentioned above which (approximately 
order wise) matches the lower bound in the previous section. Fet us divide the users into groups of size K' = 
g\N/M^ and then apply the caching and delivery scheme for each group separately. The number of file packets 
required is F = (^ ). The memory constraint would be satisfied when g < K’M/N = g\N/M~\ {M/N) which is 
true. Now, coded multicasting is done within every user group. The total number of transmissions is: — ^ — 


— - 
3+1 V 


K' g+1 


\N/M] 


. This requires = O {{\N/M~\eY) packets. 


B. New Randomized Delivery scheme 

For the deterministic scheme described previously, similar to the one in ifTSl , it is necessary to refresh (possibly) all 
the caches in a specific way when users leave or join the system that requires coordination among the caches. Now, 
we show that under an uncoordinated random caching scheme given by the new placement scheme in Algorithm [3 
and a new randomized clique cover algorithm, it is possible to have an average peak rate (with respect to all the 






17 


randomness) of about when F = O [g{^) First, we introduce the new randomized delivery algorithm 

that we use to prove the above assertion. The new randomized delivery algorithm has a preprocessing step, that we 
call the ‘pull-down phase’, in addition to Algorithmic] 


Input: Parameters K, M, N, g and F, caches for all users fc € [1 : iT] and demand set d = [di, ^2 • ■ • (Ik]- 
Let Sdkj C [1 : iT], Vfc S [1 : iT], / G [1 : F] be the exact subset of users in which the /-th packet of file 
requested by user k is stored, 
for (dfc,/) G [1 : F] X [1 : F] do 

if \Sdkj\ >5 + 1 then 
I ^dkj ^ a random p-subset of Sd^j 

end 

end 

Run Algorithm Id with this new cache configuration. 

Algorithm 5: ModihedDelivery 

Remark: Algorithm |5] emulates a virtual alteration of the cache configuration. The change in Sd^j happens in 
such a way that the algorithm pretends that a file packet is being stored in a subset of a set of caches where it 
has been actually stored. We use the same notation Sd^j to represent such a ‘virtual cache configuration’ that will 
be used for the delivery. For example, if a particular packet was stored in caches {1, 2,3,4, 5, 6} and if g = 3, a 
random subset from this is chosen. So the resultant virtual cache configuration could be {1,2,3} after this virtual 
re-assignment. The re-assignment phase is what we call the ‘pull down’ phase. This will allow us to ‘target’ the 
gain g (which is typically a lot lesser compared to t = KM/N) more effectively if we use Algorithmic for delivery. 

Let d) be the random number of transmissions under Algorithm|5]given a fixed cache configuration C and 

demand pattern d. In this case, there is further randomness that is a part of the delivery phase. Let d)) 

denote the expected number of transmissions with respect to the randomness in Algorithm |5] 

We need the following lemma from ini (see proof of Theorem 1). 

Lemma 2. HI TV Consider m balls being thrown randomly uniformly and independently into n bins. When m = 
r{n)n\ogn where r{n) is O {(log (n))^) for some positive integer p, then maximum number of balls in any bin is 
at most r{n) logn(l -f with probability at least 1 — 

Theorem 7. Using the randomized Algorithm \^for the placement scheme and the randomized Algorithm \^for 
delivery, for any set of demands d, the average peak rate, with respect to all the randomness (randomness in both 
delivery and placement ) is given by d)) < + o(l)) and the number of file packets needed is 

F = 0 ((f)(log((^)))2riV/Ml) when 2<g< \N/M] < > K. 

Proof: According to the placement scheme given by Algorithm |C every file is made up of F' groups of file 
packets. Each group has size \N/M~\. Let us consider the j-th packet of every group. There are F' such file packets. 
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We will first analyze assuming that algorithm |3 uses only the F' hie packets formed by considering only the j-th 
packet from every group. We will hnally add up the number of transmissions for every set of F' packets formed 
using the differently numbered packet (for all j G [1 : \N/M~\]) from every group. Clearly, this is suboptimal. 
Therefore, this upper bounds the performance of Algorithm |5] 

Consider a hie n. Let G" be the set of F' packets, each of which is the j-th packet from every group of 
hie n according to the groups formed during placement algorithm [3] Let Snjj L [1 : A'] be the subset of 
user caches where the /-th packet in G" is stored. Here, 1 < f < F' indicates the position among F' packets 
formed by taking the jth packet from very group. Given a user cache k, the placement of packets from the 
set G" are mutually independent of each other. The marginal probability of placing it is given by The 

placement is also independent across caches. Therefore, the number of user caches in which a particular packet 
in G” is placed is a binomial random variable Bi (^K, py/mt) where Bi {m,p) is a binomial distribution with m 
independent trails each with probability p. Therefore, by chernoff bounds (see Pg. 276 ifTSlU . Pi'(|S'„./j | < 5) < 
exp ^ ^ < exp Here, we have used the fact that g < gp^^gp- Therefore, for 

any j (by Markov’s Inequality), 


Pr > 3A'(p + l)K\N/M^ exp 


4a: \ \ 1 

9\N/M]) I - 3(p + l)A:riV/M] 


(26) 


\N/M] < 


K 


¥ log if 


and g < 


K 

SfAf/M] 


implies the following condition (which can be verihed by algebra): 




(27) 


[g+DK rjv/Ml<exp(^jp^j. 

If a hie bit is stored in p caches, then the hie packet is said to be on level p. This implies, that with high probability, 
^1 — 3(p + l)Ar[iV/M] exp )) packets belonging to hie n from G” is stored at a level above or 

equal to g. We will hrst compute the number of transmissions due to applying Algorithm |5] only on the hie packets 
in {dkJ,j}i<k<K,feli-.F'] for a particular j. 

We start by considering a hxed demand pattern d = {di,d 2 ■ ■ .dir}. Applying union bound with (l26l) over at 
most K hies in the demand d, we have: 


Now, consider Algorithm |5] The hrst few steps of the algorithm, denoted henceforth as ‘pull down’ phase, brings 
every hie packet stored above level g to level g. Consider a hie packet {dk, f,j) before the beginning of Algorithm|5] 
Given that the packet {dk, f,j) is at a level above g, after the ‘pull down’ phase, the probability that it occupies any of 
the (^) subsets is equal. This is because prior to the pull down phase, the probability that the hie packet being stored 
in a particular cache is independent and equal to Consider the F' hie packets {{dk, f,j)}, 1 < / < A'. 

Clearly, the probability of any one of them (say {dk, f,j)) occupying a given set of g caches, after the pull down 
phase, is independent of the occupancy of all other hie packets {{dk, f\j)} f^f- Let j, ^ denote the occupancy 
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after the pull down phase. Therefore after the pull down phase, which is applied only to the files in the demand 
vector d, 

Pr {SIjj = B\ \SIjJ > g, VB C , kG[l:K], 1<J< \N/M] (29) 

After the pull down phase in Algorithm |5] we compute the number of transmissions of Algorithm |4] using the 
modified ^ j after the pull down phase. It has been observed that Algorithm|4]is equivalent to Algorithmic After 
the pull down phase, all the files packets are present at file level g or below. Let us set F' = c(^) ^log((^))^ 
for some constant c > 0. After the pull down phase, let Vlg_f. be the set of file packets in requested by user 
k but stored exactly in the cache of users specified by S — k. With respect to only the file packets IJ Gj’’, the 
number of transmissions of Algorithm |4] is given by: 


No. of trans(j) = ^ 
5^0 




F' 


= E 

S#0,|5|<g+1 




F' 


= E 

|S|=g+i 


maxlV'/o 1.1 

+ E 




F' 


|S|<9 


F' 


(30) 


(a)- This is because after the pull down phase, all the relevant file packets are at a level at most g. Consider the 
event E that b = (^1 — 3{g + 1)K\N/M~\ exp F' bits of G^* for all i are stored at a level above 

g before the beginning of Algorithm |5] Conditioned on this event being true, by ( |29] |, the pull down phase is 
equivalent to throwing b balls independently and uniformly randomly into bins. Using dZTl i and the fact that 
F' = c(^)(log((^)))^, the pull down phase is akin to throwing m = {l — i{g + l)K\N/M'\ exp )F' > 

c ^1 — ^ logn(nlogn) balls into n = bins. In fact, the m balls of file dk are being thrown independently 

and uniformly randomly into bins satisfying 5 — /c : |iS| = p + 1, k G S. We apply, Lemma|2]for a particular user 
k to obtain: 


Pr 


max 


\^k,S-k\ 


> 


(l + 0(i5^)) 


E < 


S:|S|=g+i, fees F' F' ' j ^Ky 


(31) 


Please note that r(n) as in Lemma |2] is O(logiT). Now, applying a union bound over all users k to OTT ). we have: 


Pr I 3fc G [1 : AT] : max 


S:|S|=g+i, fees F' 


\Ks-k\ ^ n (l + Q(uiw)) 


F' 


E < 


K 

IKV 


( 32 ) 


This implies that all Vk,s-k are bounded in size. Therefore, we have the following: 


1 - 






<Pr ^ 


JS|=g+i 


F' 


< 


K 

3 + 1 


(i + Q(e|7?)) 

F' 


\E 
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“PM E 

V|S|=3+1 

3(g+l) 


, A'-s 


F' 


< 


5 + 1 


1 + 0 


\ogK 


\e\. 


(33) 


(a) is because: 1 > > 1 - implying ^(1 + o(iogK) ) = (1 + o(iosK) )- Putting together ((331), ^ and 

]l, we have: 


Pr ( No. of trans(j) < 


K-g 
5 + 1 


1 + 0 


\ogK 


^ N iK . 

+ 2K^\—]e ^nwMT ) > ( 1 - 


1 


2(5 + l)riV/Ml 


1 - 


(g) 

(34) 


Union bounding over all 1 < j < \N/M~\, we have: 


Pr ( 3j : No. of trans(j) > ——- (1 + 0 

5+1 


/ 1 


V logic 


+ 3(p + l)iC2[^]e 9WM1 ) < 


\N/M] 


3(5 + 1) ^Ky 


(35) 

From dZTl i. we have 3{g + < 3. Now combining transmissions for different j and normalizing 

by \N/M~\, we have: 


Pr-J (i?-"(C,d) > :^(l + o(l)) ) < 


1 


\N/M] 


1 


Oil/K) 


(36) 


3(5 + 1 ) (^^^ 3(p + l) 

In the above bad event, the number of transmissions (normalized) needed is at most K. Therefore, we have: 

< +"<"■ <>>1 S f^U + »(l))(l- ^)-OWK)) + ( 5 ^ + 0(1/A)) A < 1 jfyd + 0(1)) 

(37) 


C. Grouping into smaller user groups: approximately achieving the lower bound 

We now propose a user grouping scheme similar to the one for the deterministic caching scheme which can 
achieve the same average number of transmissions as the scheme mentioned in the previous section but with 
improved file size requirement almost matching the lower bound. 

We group users in groups of size K' = [7V/M]3p(log(7V/M)) and apply the new placement scheme (Algorithm 
O and delivery scheme of Algorithm |5] to each of the user groups. It can be seen that K' satisfies the conditions: 
e < [W/M] < and 7 < g < ^ ' Therefore, Theorem |7] is applicable. For every 

group, the average number of transmissions for a particular demand configuration is at most (1 + o(l))- 

Adding over all groups, we have the following theorem: 

(JV )2 

Theorem 8. Let the placement scheme be that of Algorithm |5] For any target gain 7 < g < 3 io^n/m) 
\N/M~\ > e, let the number of users in the system be such that K is a large multiple of [■^]3plog(W/M). Consider 
the case when users are divided into groups of size K' = \N/ M^3g\og{N/ M) and delivery scheme of Algorithm 
|5] is applied to each user group separately. For any demand pattern, the expected total number of transmission 
required for all users is at most (1 + o(l))- The file size needed is F = 0((^ )(^°S((^ )))^ T-^/-^]) ~ 

(3e)» (log(iV/M))9+V)- 
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Note: The constant e in the above requirement for file size comes due to bounding (^) by (x)^- Other constants 
in the derivation can be relaxed if (|26]) can be strengthened which we do not do here. If N/M = Q{K^) for some 
0 < (5 < 1 and K large, then for a constant gain g, the above result requires O packets whereas the 

previous best known uncoordinated random caching schemes require a file size of n(exp(A"t“^)) for obtaining a 
gain of 2. 


VI. Conclusion 

We have analyzed random uncoordinated placement schemes along with clique cover based coded delivery 
schemes in the finite length regime for the caching-aided coded multicasting problem (or coded caching problem). 
This problem involves designing caches at user devices offline and optimizing broadcast transmissions when requests 
arise from a known library of popular files for worst case demand. The previous order optimal results on the number 
of broadcast transmissions for any demand pattern assumed that the number of packets per file is very large (tending 
to infinity). We showed that existing random placement and coded delivery schemes for achieving order optimal 
peak broadcast rate do not give any gain even when you have exponential number of packets. Further, we showed 
that to get a multiplicative gain of g over the naive scheme of transmitting all packets, one needs 0{{N/MY) 
packets per file for any clique cover based scheme where N and M are the library size and cache memory size 
respectively. We also provide an improved random placement and delivery scheme, that achieve this lower bound 
approximately. 

Future interesting research directions, to go beyond the bounds derived in this paper, may include designing 
improved deterministic caching schemes. This leads to several interesting research questions on designing very effi¬ 
cient coordinated deterministic placement schemes that go beyond the current ones and possible use of interference 
alignment inspired delivery schemes (instead of simple clique cover based delivery) that optimize the file size. 
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